02/06/2024
Game theory studies strategic interactions
and multiperson decision theory
Defination - Normal form game
A normal form game consists of three things:
A set of
A set of actions
And the action profiles:
Payoffs.
Example
A game of
, , and
7,3 3,5 5,5 7,2 The payoff here is (player 1, player 2)
Defination (strictly) dominate
Action
对所有的player 2 选择的strategy,player 1选择
Definate weakly dominate
Action
In summary, strictly domination implies weak domination, and any two actions cannot weakly dominate each other.
And we have some descriptions:
Action
Action
Action
Action
Then,
A rational player will never chooses a dominated action, and will always chooses a dominant action.
Example
Player | 2 | ||
---|---|---|---|
Player | |||
1 | |||
if we do not know the payoffs for player 2, then we have no dominant and dominated actions so far.
Defination - Beliefs
Let
Tip
Here
Example
The expected utility
Given
Since the set of actions are finite, there is always at least one best action.
Defination - Best response
Action
Note
The best response correspondence is
Best response could be multiple.
Tip
Example - Sun Rain Game
Rain God | ||
---|---|---|
Player | Sun | Rain |
No Umbrella | ||
Umbrella |
Let
Find the best response correspondence given
Therefore, if
Otherwise, the choice of
and if
Defination - Never a best response
If
Proposition
A strictly dominated action is a NWBR.
Proof
If action
is strictly dominated by an action , , so , so action is a "Never a best response (NWBR)".
Tip
NWBR does not necessarily means being strictly dominated by a pure strategy. It could be dominated by a mixed strategy.
Example
U | D | |
---|---|---|
L | 3 | 0 |
M | 1 | 1 |
R | 0 | 3 |
There is no strictly dominated action,
let's say
Let's compare the expected utility given belief of player 2, for action
When
When
SO in this case,
Why? because M is dominated by a mixed strategy.
A mixed action for player 1 is a probability distribution
The support of a mixed action is the set of pure actions given positive probability.
The expected utility (payoff) of
Suppose
This means that a mixed action cannot yield a payoff higher than the best pure action in its support, since the paoff of the mixture is a convex combination of payoff of the pure action in the support.
Tip
But the mixed action is also something we needed, since it is conditional on the belief, it could be different conditional on other beliefs.
Warning
mixed strategy as best response
Suppose we extend the defination of the best response to mixed actions.
Then we have a following proposition, if we have
then every
Proof as homework
Suppose we have
hence,
Therefore, we can show
That brings a contradiction with
So, every
Note
Action
Proposition
Proof as homework.
We want to show if
Suppose
Suppose
Equilibrium is in undominated strategies. We have already known that a rational player will not play a dominated action, sometimes this fact by itself is enough to give a prediction for the solution of the game
Example
Each player has only one action that is not strictly dominated , so
is the solution.
Warning
Solution is always a pair instead of a strategy
Example - Sun rain game
There is no strictly dominated action for player 1,
is strictly dominated by , so is dominant. Player 2 will always plays . Then player 1 knows that Player 2 is rational and it will never play a strictly dominated action. Hence we can reduce the game by eliminating the strategy
for player 2. So
is the outcome for this game
Notice that we obtain this solution by iteratively deleting the strictly dominated actions.
Let's see what happens if we delete weakly dominated actions
strictly dominates , so we can eliminate the strategy for player 1. P1 will never players .
![]()
and can be the solution of the game But if we start with player 2 and eliminated the weak dominated strategy
, then it will only result in one solution
Warning
Eliminating weakly dominated strategy could be risky of losing solutions, and the final solution depends on the orders of deleting. So we never eliminate weakly dominated strategies.
Consider the following case
P1\P2 D E F A B C Starting P1
is weakly dominated by , is weakly dominated by , and is also weakly dominated Then we have
and as the solution.
Nash equlibrium
A pure action profile
That is
A mixed action profile
That is
Example
Prinsion Dilemma:
Consider the best response,
for player 1,
, and for player 2, . So
is a Nash equilibrium. We do not have a Minxed strategy nash equilibrium in this example, because strategy
is dominated by both agents, so is a never a best response. So any mixed strategy that has positive probability on will be dominated as well and becomes a NWBR.
Example - Sun rain game
Consider this example again.
Player A
, . Player B
, . So the pure strategy nash equilibrium is
. Now, what about if we have this payoff?
In this new payoff game, we have two pure strategy nash equilibrium
and . And we also have a mixed strategy nash equilibrium.
Proposition
Tip
Any action played positive probability in any NE must survive iterated deletion of strictly dominated actions.
Proof as homework
Suppose there is an action that is played positive probability in a NE
Looking for mixed action equilibrium
Example - Battle of the sexes
Two pure strategy nash equilibrium
, and . Suppose the probability of your partner choosing
is , and your probability of playing is . If you are mixing then you must be indifferent between
and which implies conditions on the action that your partner is playing So, for you, the expected util of playing
is , and . Similarly, your partner must be indifferent between
and to be willing to mix.
. Hence,
is the mixed strategy nash equilibrium in this example.
,Example - matching pennies
This is a zero sum game:
for all and . In this case, we do not have any best response to best responses. So we do not have any pure strategy nash equilibrium. But this does not imply that we cannot find any mixed strategy equlibrium, we need to check it.
Suppose
is a mixed action NE. Then,
. Similarly, we can calculate
. Therefore, we can find a mixed strategy nash equilibrium
.
Tip
Even if we cannot find a pure strategy nash equilibrium, it is possible to find a mixed strategy.
Typically we will find an odd number of nash equilibriums.
Example - Cournot duopoly
There are two firms each of whom chooses a quantity.
. Let , The payoffs are
is the price of , which is inverse demand, and is the cost of production. Let
, , for each and , The best response corresponces are
Suppose there exists an interior solution,
FOC:
Therefore,
and
otherwise. Then substitute it back to the b est response
This is a simultaneously move game
Question
Suppose that a game has two NE. Suppose one of the NE involves weakly dominated actions and the other doesn't, which one is less likely to be played?
Or suppose one of them is Pareto dominated by another NE. which one is less likely to be played?
Let's have an example for illustration,
L | R | |
---|---|---|
T | ||
B |
The PSNEs are
And we know that
And if we see the both players' strategies, we know that
Then if we consider the 1st question, say maybe we choose
Tip
Theorem
Any finite game has at least one NE.
Proof as homework
Here is a short self-contained proof.
We will define a function
Given a mixed strategy profile
The expected utility of player
For
Clearly,
with equality achived only if
JR Theorem 7.2 Proof
Proof: Let
We shall show that a Nash equilibrium of
Step 1: Define
Let
Step 2: Because the numerator defining
Step 3: Because
or
Multiplying both sides of this equation by
Now, a close look at the left-hand side reveals that it is zero, because
where the first equality follows because the
But the sum on the right-hand side can be zero only if
Theorem 7.2 is quite remarkable. It says that no matter how many players are involved, as long as each possesses finitely many pure strategies there will be at least one Nash equilibrium. From a practical point of view, this means that the search for a Nash equilibrium will not be futile. More importantly, however, the theorem establishes that the notion of a Nash equilibrium is coherent in a deep way. If Nash equilibria rarely existed, this would indicate a fundamental inconsistency within the definition. That Nash equilibria always exist in finite games is one measure of the soundness of the idea.
Recall the battle of sexes game,
We know that the two PSNEs are
Suppose the man need decide first, then
Integrents of extensive form game
Players
Nodes:
Decision nodes
Terminal nodes
Directions
predecessors and successors
Each node (except for initial node
Terminal nodes have no successor.
The game tree is the initial node
A subtree is any node and its successors
Whose decision function
Actions:
Each different act8ion leads to a different successor node.
Information sets
A player cannot distinguish between nodes in an information set but can distinguish between information sets.
The same players at each decision node in an information set, if
The same actions are available at each decision node in an information set, i.e. if
Nature's moves: nature moves randomly according to commonly known probability
Payoffs
Properties of extensive form game
A game has perfected recall if players don't forget anything during the game
A game has perfect information if each information set contains only one node: that is if playersknow the history of moves so far.
For example in this game we have four information sets, three of them contains more than one nodes, so this game is not a perfect information game. It is simultaneously move game, which means any player does not know the other player's action.
A game has complete information if the structure of the game is common knowledge (each player could draw the game tree).
we will usually transform incomplete information games into imperfect information games by letting Nature choose the structure randomly and unobservably.
Important
Complete info and Perfect info
Strategies and payoffs
In extensive form game, we will distinguish strategies and actions. we don't do that for normal games.
Let
The pure strategy for
Let
A strategy is a complete contingent plan of actions.
A pure strategy specifies a player's choice of action of each of her information sets, even for sets that are not reached.
Tip
Example: In this game we have 3 information sets for 2 players.
Then we define Mixed strategy ,
A mixed strategy
For example,
We know that we can transform extensive form game to normal form game by writting strategies and their corresponding payoffs. Then using this table we can calculate the mixed strategy nash equilibrium.
A Nash equilibrium of an extensive form game is a NE of the corresponding normal form game.
Refinement of NE
Backward Induction
is the process of analyzing a game from end to begin.
Example
A parent is driving Disneyland with a child in the back seat. The child is making a lot of noise. Parent says "be quiet, or I will turn this car around." This situation is represented in the following extensive form game, where the child's payoffs are given first.
Write down the norm form game:
TT TD DT DD Q -10,-5 -10,-5 5,10 5,10 N -5,-10 10,5 -5,-10 10,5 There are three pure strategy nash equilibrium here in the norm form game,
. By back ward induction, the remaining strategy profile is
. The strategy profile
is a NE. But is th e parents actually willing to turn back at node (left lower node)? No So is that a credible threat? No
Conditional on reaching node
, Parent's BR is Disneyland. The same is true at node . Anticipating these decisions, child know that if he chooses quiet he will end up with payoff of 5 and if he choose noise he will end up with payoff of 10. The strategy profile
is the backward induction solution to this game and it's also a NE.
Important
Proposition
Every backward induction solution of an extensive form game is NE.
Every finite perfect information game has at least one backward induction solution in pure strategies. Thus every such game has a PSNE.
But backward induction solutions may involve weakly dominated strategies.
Example
Consider this game:
, . Consider the following normal form game:
C D A 1,1 1,1 B 1,1 0,0
and are both solutions to this game, but is weakly dominated.
Example
This is an imperction game, so backward induction cannot be applied. BUT we can still eliminate strategy
because it is strictly dominated. If we change the payoff to this scheme:
We need subgame perfect
Defination
A subgame is a sub tree such that
Starts at a decision node and
Contains no broken information set
That is if an information set contains a node in the subgame, then every node in that information set is contained in that subgame.
Example:
This game has 2 subgames. One starts from
and the other starts from .
Defination - SPNE
A subgame perfect equilibrium is a profile of strategies where restrictions to any subgame forms a NE of that subgame.
Note
Every subgame perfect equilibrium is a NE (To the whole game), if a game has only one subgame, which is itself, then every NE is a subgame perfect Nash Equilibrium (SPNE).
Tip
In games of perfect information (we do not have any info set that contains multiple nodes, i.e. no simultaneous move game), the set of subgame perfect equilibria and the set of Backward induction are the same
Our backward induction is actually deriving a SPNE.
The advantage of subgame perfection is that it is defined even for games with imperfect information.
TO find a SPNE in an imperfect information game, the procedure is similar to backward induction. we just replace subgames at the end of the game with their equilibrium payoffs, and repeat until you reach the initial node.
Example
There are two subgames in this game.
For the second subgame, the normal form is,
P3 L R Player 2 T 0,1 1,0 B 1,1 0,0
is the unique aNE of subgame strategies from . Then the payoff at node
degenerates to , then apply backward induction, palyer 1 will choose , and the SPNE is . Next, consider the following exercises: find all the Nash equilibriums in this game (Hint: rewrite this game into a normal form game, we typically need two matrixes)
When
is played by P1,
L R T 1,10,10 ✅ 1,10,10 ✅ B 1,10,10 1,10,10 ✅ When
is played by P1,
L R T 0,0,1 0,1,0 😂 B 2,1,1 ✅ 0,0,0
Example
In this game, by backward induction, the SPNE is
. but we do have other Nash equilibriums.
Let's add a couple of actions to the battle of sexes.
We have two pure strategy nash equilibriums.
Now, let's augment some of the choices, adding
O | B | C2 | |
---|---|---|---|
O | 2,1 | 0,0 | 6,0 |
B | 0,0 | 1,2 | 0,0 |
C1 | 0,0 | 0,0 | 5,5 |
Still, we have the same original two nash equilibrium. Adding
But consider the game
The payoffs from
There exists a SPNE in which
, and in the second stage
They represents your strategy in the first and second stage,
The following strategy form a SPNE of
We consider this repeated game using SPNE, in the second stage, the SPNE is
First, note that this strategy profile is a Nash equilibrium of the whole game. If man follow the equilibrium he gets payoff
For the woman, she does not has any incentive to deviate. She plays a statistic best response in both periods.
The equilibrium is a SPNE because both
Now, lets define this game formally.
Defination
Given a normal form game
A history
Like
A pure strategy for player 1 is
The outcome of a pure strategy profile
Like the previous example,
, We define like
, . Then we can also write in this way:
.
The value
Definations
The payoff of player
like,
. It will sometimes be convenient to rescale these payoffs so that they are directly comparable do the stage game payoff
From now, we do not asuume
. If
, then, , and . if
. say
Tip
Finitely repeated games
Consider prisoner's dilemma. we repeat it for
C | D | |
---|---|---|
C | -1,-1 | -4,0 |
D | 0,-4 | -3,-3 |
There is a unique NE for stage game.
Here becasue each subgame only have one nash equilibrium, so when we do backward induction, we can degenerate the game tree always with a certain equilibrium. So there will be no room for punishment to deviate from the nash equilibrium at each stage.
Formally, we say in any subgame perfect equilibrium of
Proposition
Suppose that stage game
if the stage game has more than one NE, then the finitely repeated game has more than one subgame perfect equilibrium. For example, for each
Given a normal form game, let
The minmax payoff
That is the other players are minimizing the value of
The minmax strategy that they (the other players) use against
Proposition
For any
Note
Proof idea
Player
Tip
Example
Since
For
Tip
Example
Think about the mixed strategy.
To find
if
which means that
Infinitely Repeated Games
The principle of optimality (also known as the one shot deviation principle) is most useful when we deal with in finitely repeated games Principle of optimality
A strategy profile
The principle of optimality makes checking for subgame perfection much easier by reducing the number of deviations that we need to check. The idea is that if there is any profitable deviation, then there must be profitable one-shot deviation.
Tip
Example
Strategy:
if no one has deviated, Man's equilibrium continuation payoff is
Since any deviation results in the same future play, Man's best deviation is his best static deviation is
Then the payoff is
Then we look at the woman's equilibrium continution, woman's best deviation is either
In any history after a deviation, man's equilibrium continuation payoff is
Refinement of SPNE
In this game we have two SPNE, pure strategy,
But for player 2,
So, even if
We ought to require that at nontrival info sets, players maximized expected utility according to some beliefs about which node they are at.
Tip
Example
There are two subgames.
We know that
Let decision node
Tip
Understanding in this way:
In our analysis, we allow nature
to be the the first player in the games in which we have nature. Given the strategy profile
Defination
[Blelief] An assesment is strategy/condtional belief pair
Note
A system of beliefs
A system of beliefs specify, for each information set, a probabilistic assessment by the player who moves there, conditional upon play having reached that information set.
Let
[Sequentially rational] An assessment
That is
Note
A formal statement
A strategy profile
for all
An assessment
it is sequential rational , and
belief
Note
Proposition
if
Proof by intuition
Suppose
Therefore, denoting by
Hence, given belief
This inequality condition violates the defination of sequential rationality. So in that case, if
Therefore, if
Tip
Example
In order to find PBE, we know it must be a subset of SPNE, so we can first find the set of SPNE.
In this example, we only find one SPNE, so it must be part of PBE.
And suppose player 2 know that player 1 will always choose
When
Another example,
we know that in this game
We know that
if
Tip
Example
In this game, player 2 only knows the player 1 chooses
We only have one subgame,
Each of the part,
The NE of this game is
They are SPE,
Then,,
Homework
Consider the following sealed -bid first price auction,
Sealed-bid, no knowledge about the other participants
first price: the bid with highest price wins.
single object
only two allowable bids
two risk neutral bidders (maximizes expected payoff)
Bidder
Each bidder views her valuation and then simultaneously and independently submits bid of either
higher bidder wins and pays her bid
ties broken by fair coin.
[Solution]
There must be a threshold
At
Suppose 2 bids according to
the payoff
then we can calculate the expected payoff for player 1,
And,
Then player 1 is indifferent vetween bidding
and player 2 is indifferent between bidding
Defination
Let
Standard Auction Formats
First price auction: Each bidder
Second price auction: highest bidder wins and pays the second highest bid.
English auction: Bidder dynamically submit successively higher bids. Final bidder wins and pays her final bid.
Dutch auction: Auctioner starts at a high price, successively announces lower price until someone bids. The lowerst bidder wins and pays the current price;
Homework
Intuitively explain the difference and simularities between Dutch, English and first price.
[Solution] of the sealed bid second price auction
Consider the following cases,
Bidder | Bidder | Bidder | |
---|---|---|---|
Playing Bidder
A seller owns one unit of an indivisible good, the good is worthless for the owner unless she can sell it. There are two buyers, 1 and 2. Assume that buyer's valuation of the good is private information. Available information for the players
Consider the first price sealed bid auction. Buyers strategies depends on their valuation.
Let
A BNE is a pair of strategies such that
Then,
Then,
Then the maximization problem becomes,
FOC:
Then,
Then
Then the optimal bidding strategy is
The expected payment of winner:
The expected gain for the seller:
Homework:
Find the expected payment of winner, expected gain of winner, and expected seller gain under second price sealed bid auction.
The optimal strategy for each bidder is to bid
, if we assume each bidder's valuation Suppose there are only two bidders in this auction. Denote
as two players' valuation.
expected payment of winner
Expected gain of winner
Expected seller gain
A worker's random type or ability
Firm complete over wages to hire the worker. The firms observe education
The cost of attaining education level
Assumption on
Payoff to a worker of type
The profit to the firm is
We look for symmetric PBE in pure strategies, That is we look for 3 functions,
To be PBE, the functions must satisfy the following conditions,
To simplify the notation, let
There are two types of PBE,
Pooling
Separating
Let's discuss the separating equilibrium first
Tip
In any separating perfect Bayesian equilibrium,
This is because upon firm seeing the education level
Similarly, upon seeing high education level
Then for low type workers, they will have